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(57) Abstract 
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i 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


Ft 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


UJ 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Henegovma 


GB 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HI) 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


It 


Ireland 


MN 


Mongolia 


LA 


Ukraine 


BR 


Brail 


1L 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Iidy 


MX 


Mexico 


uz 


Uzbekistan 


CP 


Central African Republic 


JP 


Japan 


NK 


Niger 


VN 


Viet Nam 


CC 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


COte d'Tvoire 


KP 


Democratic Ptopfc'i 


NZ 


New Zealand 






Cm 


Cameroon 




Republic of Korea 


PL 


Polwid 






CN 


China 


KR 


Republic of Korea 


FT 


Portugal 






cu 


Cuba 


KZ 


Kazakitan 


RO 


Romania 






C2 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Fedemion 






DB 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


SriUnka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 98706872 PCT/IB97/00976 

1 

METHOD OF COMPUTER-AIDED AUTOMATED DIAGNOSTIC 

DNA TEST DESIGN 

The present invention relates to techniques for designing comprehensive DNA 
dinmiostic tests for imitations in disease genes; being more particularly directed to 
computer-aided searching for optimized predicted conditions for Polymerase Chain 
Reaction (PGR) Amplification, including multiplex PCR amplification, optimal melting 
behavior in denaturing gradient gels and optimal distribution by two-dimensional 
electrophoresis of the mutational target fragments, thereby to provide an overall design for 
actual diagnostic tests. 

BACKGROUND OF THE INVENTION 

As is well-known, a strand of DNA is comprised of four different nucleotides, as 
determined by their bases: Adenine. Thymine, Cytosinc and Guanine, respectively 
designated A. T, C. G. Tor each strand of DNA. there is a homologous strand in which A 
pairs with T, and C pairs with G. A specific sequence of bases which codes for a protein 
is referred to as a gene, and that gene is segmented into regions which arc responsible for 
protein compositions (exons) and regions which do not contribute to protein composition 
(introns). An exon can vary in length from about 30 bascpairs to thousands of bascpairs 
(bp). For purposes of the present invention, however, primary concern resides with the 
coding regions of the gcnc(cxons), though the test procedures of the invention can be 
easily appropriately modified, where desired, to screen for carcinogenic mutations in the 

introns, also, and for other purposes as well. 

DNA diagnostic testing of genes with mutational defects is important for 
ascertaining information with rcgatxl to individual susceptibility to particular diseases, 
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classification ol the disease in a therapeutically relevant .sub-group, the carricrship (and 
potential transfer) or prenatal presence of birth defects, and other important purposes. 
The gold standard in DNA diagnostic testing for the presence of mutations is DNA 
sequencing, which involves the complete decoding oflhc gene. This, however, is time- 
consuming and inefficient. Time-consuming because despite numerous ongoing attempts 
to simplify and greatly accelerate the process, alt currently used routine sequencing 
systems arc still based on the principle originally developed by Sanger and colleagues (as 
described, for example, in Molecular Pathology , Mcim and Silverman. 1990. pages 7-10), 
and only partially automated. They arc inefficient because, instead of simply determining 
differences between entire DNA fragmcnls. every DNA fragment must be completely 
decoded before any differences from normal (mutations) arc revealed. The fact that the 
exact location ol a disease-causing imitation can be different from one individual to the 
next, moreover, precludes the possibility of only testing for frequently occurring known 
mutations. Indeed, the many different mutations that may convert a healthy gene into a 
diseased gene makes it necessary in each case to inspect the entire gene at all possible 
positions for mutations (termed gene scanning) ralhcr than for only a few frequently 
occurring ones (imitation screening). Mutation screening methods arc relatively simple 
and cost-efficient. OuUidc DNA sequencing, potential gene scanning systems arc scarce 
and their cost-efficiency is questionable. 

Recently, a method was disclosed by one oflhc applicants of the present patent 
application Tor comparative scanning of 100-600 bascpairs (bp) gene fragments by 
multiplex PCR amplification followed by two-dimensional clcctrophorctic separation in 
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polyacrylamidc gels on the basis or both size and bascpair sequence; "Multiplex Co- 
amplificalioii oF24 Retinoblastoma Gene Cxons After Pre- amplification By Long-Distance 
PCR," Jan Vijgand Daizong I.L Nucleric Acids Research . 1996, Vol. 24, No. I. p. 538-9; 
"Two-Dimensional DNA Typing", Jan Vijg. Molecular Bio technology, pages 275. on. 
Vol. 4. 1995; and in copending US patent application Serial No. 08/471,249. filed June 6, 
1995 lor Method (X And Apparatus For Diagnostic DNA Testing. The latter can be 
accomplished by denaturing gradient gel clectiophorcsrs^DGGE:). This multiplexing 
technique for analyzing predetermined gene cxons derived from DNA. involves adding 
primer pairs surrounding successive groups of the gene cxons followed by effecting long- 
distance PCR amplifications thereof" in a common tube or vessel (multiplex long-PCR) to 
achieve relatively long resulting amplicons; adding further primer pairs surrounding each 
of the gene cxons or parts thereof, and then effecting multiplex PCR amplifications thereof 
in the common tube or vessel with relatively short resulting amplicons; and 
clcctrophorctically separating the gene fragments. By size separation, mutations 
representing deletions or insertions varying from several to many bascpatrs can be 
detected. In DGGE. point mutations, such as bascpair substitutions arc also delectable. 
This is due lo the tendency of double-stranded DNA fragments to melt at a point in the 
gradient where the temperature equals the melting temperature of the lowest-melting 

domain of the fragment. 

In the process of PCR amplification, the mutati onal target fragments (e.g., gene 

cxons) arc surrounded by primers, i.e., short (v,bout 20 base pair^in^ DNA 



fragments. Primers ate chosen in such a way that they arc complementary (bind to) 





WO 98/06872 4 PCTA1B97/00976 

positions ;il the lett and right boundaries or ends oflhc target fragments. By using 
appropriate enzymes that extend each primer inwards, towards each other, the mutational 
target can be copied. This can be repeated a great number of times in a .so-called 
ihcmiocyclcr - - a machine that first heals up the DNA sample, thereby separating the 
single strands followed by cooling down, which results in annealing of the primers to their 
target sequences, and the subsequent enzymatic extensions of the primers by polymerase 
enzymes. The net result is on amplification of the fragment in between Ihc primers of 
typically one million times. This provides enough target DNA to detect the fragment by 
using a DNA-binding dye, after clcctrophorctic separation, without the use of radioactive 
tracers: the test of the DNA being now a relatively small amount and invisible. 

The poMliomnj^ because, for such short sequences, there is 

ample opportunity to bind elsewhere in the complex DNA molecules that form the starting 
material oflhc test. This would lead to the copying of other fragments than the ones of 
interest. Positioning of primers is even more critical in denaturing gradient gel 
electrophoresis, where each fragment must have an optimal melting temperature in order 
to allow all possible mutations to be delected. It is common practice to couple one oflhc 
two primers surrounding a gene target fragment to a GC-rich clamp sequence of about 30 
bascspait.s long. This clamp is very stable and functions as the highest melting domain: 
that is, the part oflhc DNA molecule that keeps the fragment together. This is important 
because once a fragment migrates off the gel it can no longer be detected. In PCR-DGGE 
it is just as important, furthermore, that the target fragment consists of one single domain 
(flat throughout the gene fragment) that has a lower stability than the GC-clamp. In that 
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case, il will melt earlier than the clamp, resulting in a structure that is partially double- 
stranded ((tic clamp) and partially single-stranded (the target fragment): n so-called 
branched .structure. Such a fragment will be greatly retarded in the gel. Typically, the 
exact position where such a fragment melts (and thus halts its migration) is completely 
dependent on its sequence. With a fragment of. say, 500 bascpairs. one single bascpnir 
difference will lead to a migralional difference that can be employed by detecting 
mutations in such fragments as a migralional difference with a control (wildtypc) 
fragment. I lence, in contrast to DN A sequencing, DGGE docs allow comparative 
scanning or whole fragments for mutational differences without the need to completely 

decode each motccuc. 

To cany out DGGE in two dimensions (2-D) rather than in one, increases the 
efficiency of the system as well as its reliability. Indeed, in a 2-D gel. many more 
fragments can be analyzed simultaneously than by a I D separation. It is more reliable 
because every fragment can be defined by both its melting temperature and its size. A 
disadvantage, however, resides in its increased complexity, requiring extreme attention to 
the design of the test. Since the design of a DGGE test itself is not trivial (primers must 
be chosen in a way that the amplified fragments represent a single domain, as before 
discussed). ihc 2-D principle adds a dimension in complexity as well as in resolution. A 
typical design of PCR primers, moreover, must lake into consideration many other 
variables inherent to the PCR process, sucli as primer annealing temperatures. 

As disclosed in the above-cited Vijg papers and the said patent application, Ihc 

r 

i 

two-step PCR process enables many different exons to be amplified simultaneously in the 
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same reaction. In this method, first, groups or target Fragments (e.g.. groups of exons) arc 
amplified as large 5-40kb amplicons, for example, by long-distance PCR (an improved 
fonn of PCR in which more efficient polymerase enzymes arc employed that generate 
longer fragments). Second, with these amplicons as templates, large numbers of individual 
target sequences can now be amplified simultaneously in the same reaction vessel under a 
single set orexpcriincnlnl conditions. Normally it is very difficult to find one set of 
reaction conditions under which multiple fragments specified by multiple primer pairs arc 
amplified simultaneously (i.e.. multiplex PCR). Probably because the prc-amptification by 
long-distance PCR increases the amount of target sequences relative to the rest of the 
complex genomic DNA. flexibility with respect to the subsequent PCR amplification of the 
target sequences is much greater than normal, which permits extensive multiplexing; i.e., 
co-amplification of different target fragments in the same reaction. 

This method or extensive multiplexing greatly economized the process of 
template preparation in genetic testing over earlier techniques for inspecting for mutations 
after PCR amplifying the many distributed exons of the often very large disease genes. As^ 
an example, it is now possible, to generate a collection of as many as 25 fragments 
corresponding to 26 exons of the tumor supprcsscr gene RBi in one single two-step PCR 
reaction. After the two-step multiplex PCR amplification, a single 2-dimcnsionat 
clcclrophorctic separation, as the third step, is sufficient to resolve all these fragments and 
detect all possible mutations as variations in fragment spol position. With this system, a 
genetic testing method has become available that is both highly accurate (in delecting all 
possible mutations) and cost-efficient. 
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The only remaining drawback involves (he absence of a rapid design of an optimal 
Icsl for one or mote individual genes involved in a particular disease. The I mmnnj jc noinc. 
indeed, contains an estimated number of 100,000 different genes, many of which might 
ultimately prove to be involved in one or more diseases. To design a set of PCR primers 
for many different genes and/or gone combinations that fulfill criteria for optimal 
(multiplex) PCR, optimal denaturing gradient clcctrophorctic separation and 2-D 
distribution is not trivial. As shown in later-described Fig. I, the computer-assisted test 
design of the invention must provide, for an optimal 2-0 genetic test for one or multiple 
genes with predicted primer and GC-cIamp respective positions and lengths for meeting 
optimal melting criteria, PCR criteria and 2-D spot distribution criteria. 

The present invention is directed to solving this problem and, through a computer- 
assisted procedure, to scini-automntically n nil/or automatically design multiplex PCR/2-D 
clcctrophorctic tests for one or more genes. 
OBJECTS OF INVENTION 

An object of the invention, accordingly, is to provide a new and improved method 
of and apparatus for computer-aided rapid design of a genetic test based on multiplex PCR 
amplification of the target sequences, followed by clcctrophorctic separation of the 
iraumcnt mixture in two dimensions: size and bascpair sequence. 

A further object is to provide for superior performance of multiplex PCR/2-D 
electrophoresis tests through the heretofore unavailable rapidity in computer software- 
aided selecting of optimal sets of conditions among the large number of possibilities 
available. 
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Other and further objects will be described hereinafter and arc more particularly 
delineated in the appended claims. 
DESCRIPTION OF THE DRAWINGS 

The invention will now be described in connection with the accompanying 
drawings in which: 

Fig. I. as earlier mentioned, is a block diagram of the underlying computer-assisted 
system for optimal 2-D genetic lest design for one or multiple genes: 

Pig. 2 represents experimentally clcctiophorctically obtained mutation displays of 
exemplary l-D and 2-D DOGE tests on illustrative BRCA-I gene fragments; 

Pigs. 3, 4 and 5 arc melting graphs (melting temperature vs. bascpair offset) 
respectively of unacceptable multiple domain and acceptable left and right end GC- 
clampcd single domain profiles, taken for an exemplary retinoblastoma gene RB Exon 21. 
Fragment U 1 : 

Figs. 6A and 6B present flow diagrams of the present-day sequence of process 
slcps required for determining appropriate primers and GC clamps for clcctrophorctic 
testing of designated gene fragments: 

Pigs. 7 A and 7B are similar How charts of the steps involved in scmi-automaticnily. 
will) operator assist, rapidly designing appropriate test primers, GC clamps and melting 
profiles and providing simulated clcctrophorctic gene separation displays in accordance 
with the computer-aided software techniques of the present invention; 

P^s. RA and RB arc How charts similar to Fius. 6A and 6B for full automation. 
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l'*ig. 9 shows a preferred computer display screen with the various monitoring and 
display icons, boxes or windows* and options available to the investigator;. 

Fig. 10 shows this screen with inputs for n particular desired gene— in this 
illustration. P53 involved in colon and other ennccrs. assuming no prior knowledge of 
appropriate primers, temperature gradients, etc.; 

Pigs. 1 1 A and I IB arc similar views with different test values of primers. (JC 
clamp, etc. producing unsatisfactory (sicmifa(cd) niching profiles with multiple domains. 
Fig. 1 1 A with the clamp located on the right, and Fig. I IB. on the left; 

Fig. 12 shows a satisfactory melting profile for corrected test values of primer, GC 
clamp, etc., with Figs. I ) \, 1 1 13 and 12 showing variation of predicted melting curves of 
cxohs of the gene obtained with only slight variations in the positioning of one or more of 
PCR primers, lengths of primers and O'C-clamp. type of GC-clamp, and length of the 
fragment, which serves to illustrale the real and qualitative features of the invention with 
respect to the final test design, and with Fig. 12 containing an additional column read-out 
of temperatures to the right of the graph; 

Fig. 13 illustrates displays of sequences with inciting temperature; 

Fig. 14 shows a predicted 2-D display that the design of the invention will enable 
to be produced in actual 2-D electrophoresis operation in accordance with the design; and 

Fig. 15 is a time graph for test design for gene RBI comparing the prior manual 
procedures with those of the invention. 
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SUMMARY OF THE INVENTION 



In summary, the invention embraces a method of computer-aided genetic test 

design that involves capture of a gene sequenc e file, e.g., from Gcnb ank: indicating the 

1 * ■ — ^ . — 

start mid ends of desired target sequences; generating optimal melting profiles, I'CR 
conditions and 2-D distributions by varying (I) fragment size (gap-size), (2) GC-clamp, 
and (3) primer size: and reporting the selected optimal primer sets for being synthesized. 

From a more detailed viewpoint, the invention embraces a computer-assisted 
method for generating the design of optimized and comprehensive genetic DNA 
diagnostic tests based on l'CR/2-D electrophoresis for screening mutations in genes, that 
comprises, inputting a computer microprocessor with a desired gene exon fragm ent 
ATGC letter sequence; programming and controlling the microprocessor first to select a 
trial start a nd end of the sequence ; then, s electin g and p ositioning t rial primer letter pairs 
to define U icjjppo^ to be tested: adding n trial GC clamp to at 

least one primer: generating trial melting profiles for the trial primers and GC clamp to 
determine whether the profile consists of a relatively high-melting GC clamp peak and a 
substantially single fiat domain of relatively low-melting value for the fragments; in the 
event that the trial profile has multiple and not a satisfactory single flat domain, changing 
one or both of other trial primer sequences and GC clamp sizes and generating said 
profiles therefrom until such a sutislactory single Hat domain is attained, and for primers of 
each pair having substantially similar annealing temperatures suitable for PCR annealing; 
comparing Ihc primer letter sequence with other known gene sequences in the context of 
the total letter sequence of the gene exon to insure the uniqueness of the sequences and to 
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avoid overlap with any other than the selected gene fragment; simulating a 2-D 
distribution oT the fragments defined by the primer pairs by separation along one 
dimension by size and along an orthogonal dimension by melting temperature of the 
fragment; selecting a gradient of increasing melting temperatures along the orthogonal 
dimension to allow an optimal spread of (he fragments thcrcafong; and repotting the 
selected primer sets and temperature gradient, thereby providing an optimal design for an 
actual PCR/2-D electrophoresis test. 

Preferred and best mode designs and details arc hereinafter fully described. 
DESCRIPTION Or PREFERRED EMBODIMENTS 

The present invention, as before stated, involves a procedure for the semi- 
automated and/or automated computer-aided design of optimized genetic screening tests 
based on PCR72-D electrophoresis, and preferably multiplex PCR/2-D electrophoresis. 
Brief Summary Of Preferred Diagnostic Procedural Techniques 

In accordance with the procedures useful also with the computer-assisted semi- 
automated and automatic features of the invention, two DNA fragments of, for example, 
I l-50kb arc constructed which arc homologous to an I R to 25 base sequence in the 
intronic sequence both preceding and following an exon; (he leading and trailing primers. 
Because many mutations occur in the transition from intron to exon, these priming sites, 
as before explained, arc separated from the exon by a gap of at least, say, 10 base pairs, 
and usually not more than about 40 base pairs. For each "multiplex" test herein 
described, optimal printers for all of the fragments would have comparable annealing 
temperatures, as previously noted. If the primers all anneal within a few degrees 
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centigrade of one another, each fragment should be equally amplified in the multiplex 



Once primers ate selected, the exon is amplified from template DNA by PCR. 
1 he DNA sequence which is amplified was earlier referred lo as a PCR fragment or on 
amplicon, including the exon, the primer sequences, the intronic gaps between the exon 
and primer sequences, as well as usually a 30-60 base pair GC clamp, for example, which 
will be discussed later in reference lo the computer software programming of the present 
invention. With such operation, the PCR fragment is at least about 86 base pairs longer 
llum the exon itself. Following a successful multiplex PCR, millions of copies of the 
desired fragments exist. These can be viewed on a gel following electrophoresis on both 
size and denaturing gradient gels; it being important that each amplicon has its own unique 
spot on the 2-Dimcnsional mapping. In other words, the fragment should have a size an d 
melting temperature unique to nil other fragments included in the same test. The 
computer software of the invention, accordingly, is designed, as later detailed, to enable 
the user more readily lo attain a test in which these criteria arc optimally met. 

With regard generally to the software design for achieving the purposes of the 
invention, an optimal test design would insure (hat each fragment would have a different 
size as well as i different Tm or melting temperature. Using the later-described program, 
the fragment length can be varied by adjusting the length ofjiyc-prirncr and by adjusting 



the position of the primer. Primer length for short PCR, as before described, generally 



ranges from about 18-25 bases pairs; and primer position is usually between about 10 and 



PCR. 
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30 base pairs from the beginning and end or the exon. This is also described as the bclorc- 

menhoned gap of. say, IO-30bp. 

To determine (he Tin of the fragment, the computer uses the Lcmian Program 
(described, for example, in "Computational Simulation of DN A Melting and Its 
Application to Denaturing gradient gel Electrophoresis," Leonard S. Lcrman and 
Silvcrstcin, Mr.hnds In Fiizvmotoev. pages 482-501, Vol. 155, 1987), which will produce 
a theoretical Tm for a fragment. %UF (uica-fotmamidc) is a chemical gradient from 
which the actual melting temperature Tm in degrees C can be determined. Tm. moreover, 
is completely dependent upon base sequence. An A-T bond is less stable than a C-G 
bond. If n fragment has a high C-G content, therefore, it will melt lower in the gel (high 
Tm). Likewise, a fragment with a high A-T content will have a lower melting temperature 
and will melt sooner (lower %UF) in the gel. Because the bases have an effect on 
neighboring bases, a single mutation, called a point mutation, can alter the To of the entire 
fragment enough that it can be detected by denaturing gradient gel electrophoresis. For 
example, a point mutation in which a single T is changed to a G, is enough to change the 
melting temperature of the entire fragment from, for example, say 56% UF to 58% UF. If 
an A or a T changes to a C or a G, the mutant will have a higher Tm than the latcr- 
described healthy wild-type, due to the increased stability of the C-G bond. By similar 
reasoning, if a C or a G is mutated to an A or a T, the mulatit will melt at a tower %UF to 

To. 

At the end of the PCR, a special cycle is introduced in which the DNA is denatured 
and then allowed to bond with a complementary strand. Because evety individual 
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possesses two double stranded alleles or copies of the same gene, one from the mother 
and one from the father, four combinations of alleles can occur during this cycle. If a 
mutation is present in one of the alleles, it is referred to as the mutant strand. The four 
possible combinations would then be wild-type with wild-type, mutant with mutant, (these 
two arc referred to as homoduplcxcs) and wild-type with mutant and mutant with wild - 
type (referred to as hctcroduplcxcs). 

Using the terminology mother (M) for the maternal allele and father (F) for the 
paternal allele, the four strands can be called MM and FF for the homoduplcxcs. and MF 
and l : M for the hctcroduplcxcs. When a mutation is present, the hctcroduplcxcs will be 
notably less stable than cither or the homoduplcxcs. If, for example, a point mutation 
from a T to a G has occurred, as above described, in the hetcroduplex strands, there will 
be a destabilizing mismatch of A with C and T with G. The mismatch will cause the 
hctcroduplcxcs to become far less stable than cither of the homoduplcxcs. The presence 
of a mutant allele will accordingly produce four spots in a 2-D pattern. The mutant and 
wild-type homoduplcxcs arc separated due to the nature of the mutation, and the two 
hctcroduplcxcs will melt considerably higher in the gel due to their destabilizing mismatch. 
In the above example of a T to G point mutation, the wildtypc homoduplex will melt at 
say, the before -mentioned 56% UF, the mutant homoduplex will melt at 58% UF. and the 
two hctcroduplcxcs will melt considerably above, but at melting points different from each 
other. 

Thus, in the example of a BRCA-I mulation/hclcroduplcx shown in Fig. 2, two 
control samples with the same polymorphism and a sample with a 1 -bp insertion mutation 
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detected by l-D DGGE (ptiotogrnplis A to (lie left) and by 2-D DGGE (photographs B to 
the right) illustrate how mutations arc revealed in the system by the appearance of four 
bands and spots, respectively, instead of one, and also show how the mutations or 
polymorphisms can be clearly distinguished from one another. 

In accordance with Ihc present invention, the computer helps determine a test in 
which each fragment produces a distinct spot on (tic 2-D pattern in which a hetcroduplex 
at that position can be easily detected. 

The earlier mentioned Lcrrnan's Program ("Melt 87") calculates the melting 
temperature of each base pair. Then a program *'SlidcWritc" (described in the Advanced 
Graphics Software. Inc. Manual ''SlidcWiitctT?) Plus for Windows™.'* 1994) is used to 
construct a melting graph in which the different melting domains of the fragment can easily 
be viewed. The generation of a CG clamp at one end of the fragment can influence the 
remainder of the strand to act as a single melting domain. Without such clamp, there arc 
generally varying or multiple levels or domains for the fragment (Fig. 3). In Ihc gel, a 
fragment with a GC claiup-induccd single melting domain will melt, but the GC clamp 
itself, which melts considerably higher, will remain intact, as shown by the left-hand spike 
or peak in Fig. A. Without a GC clamp, the fragment will simply split into two single 
strands and run off the get. With the intact GC clamp, one strand enters a pore in the gel 
and the other enters another pore in the gel, and the intact GC clamp inhibits the fragment 
from moving any further in the gel. On the computer, a GC clamp oC for example, 30, 35 
or 38bp can be selected. The user can also choose to which primer the clamp should be 
added: Fig. 5 showing the clamp spike or peak to the right. The optimal melting pattern 
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for a given frogmen! is arrived at through trial and error by selecting different primer 
lengths and positions and then trying different types and positions of GC clamp. 
Operation Before Hie Present Invention 

Before the present invention, the user would enter the DNA sequence. Then 
Lcrmau's program would begin asking questions. After the user had answered all the 
questions, the computer would calculate the melting temperature. After the melting 
temperature was arrived at, the outcome had to be transferred to another program which 
would construct a melting map of the fragment. If the fragment was not optimal, the user 
would return to the Lcrman Program and alter the primer position/length manually, and 
determine where they had been placed on the first run. Once the user had achieved 
adequate melting maps for each of the fragments, a 2-D pattern could be constructed. If 
each spot did not have a unique position on the 2-D pattern, the user would have to start 
all over with whichever fragments overlapped. 

Using this system, it took about four weeks to design a test for RBI, though still 
sub-optimal. With the program of the invention, a more optimal test was designed in 
about four hours. Tor more genes participating in the same test, design problems using 
MELT 87 became increasingly severe. The point comes at about 30 fragments when it is 
no longer possible to design manually according to the Lcmian program. 

Fig. 1 5 plots the time in hours as a function of number of fragments for test design 
for RBI of prior techniques and those of the invention. 

Curve A depicts data taken during the construction of the two-dimensional DGGE 
test for RBI with fully manual software. 160 man hours were necessary to design a test 
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which could adequately test the 27 cxuitsol the retinoblastoma gene. Manual test design 
has an empirically determined capacity limit of about 40 fragments. Curve B. on the other 
hand, exhibits the drastic improvement in both time efficiency and test capacity 
experiences with semi-automated test design of the invention. With such semi-automated 
test design, the 27 exon RBI lest was redesigned in approximately two and a half hours. 
Curve C. furthermore, predicts the exponential improvement Rilly automated test design 
software of the invention can have on the construction of two-dimensional multiplex tests, 
extending the capacity of two-dimensional multiplex gene testing even to the theoretical 
resolution limit of 2200 fragments on a 15 cm x 15cm denaturing gradient gel. 

The program cX the invention, furthermore, is able to calculate the Trn or annealing 
temperatures of the primers which also greatly facilitates the development of a megaplex 
PCR. in the past, the user had to consult yet another program for this information. 
Lastly, the program herein also searches the gene sequence to insure that each primer 
sequence is unique so that only the desired PCR fragment will be amplified. For this 
purpose, only the long-PCR fragments have to be searched through. Before this feature, 
the user would have to conduct a time-consuming blast search on the Internet in order to 

insure the uniqueness of the primers. 

A typical scries of these earlier time-consuming steps for finding appropriate 
primers, GC clamps, etc., and melting profiles, is presented in the Flow Charts of Figs. 6A 
and 6B in connection with the illustrative case of retinoblast o ma genes ( RBG), selected 
as at I , Fig. 6A. After finding the numb:r of exons and starting with the first (2). the 
bcuinnmii and end locations or the exon arc sought (3). If the number of base pairs is not 
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greater than, for example, 500 letters (ATGC), the exon string is increased by at least, for 
example, 18 on each end. to create primers (5); but if there arc more than 500 bascpairs. 
they are broken up into smaller sections, with overlap as at (4). GC clamp sequences arc 
added on the header and trailer (f>> with stripping and reordering to leave 6 groups of 
separated ATCG sequences, and with a new exon tile. Melting profiles arc then run six 
limes, three with three types of GC clamps on the leading primer and three on the trailing 
primer, at (7); and then corresponding graph profiles arc generated at (8), Fig. 6B. such as 
arc shown in Figs. 4 and 5. If the melting profile for those primer sequences produces a 

i 

{ satisfactory flat single domain at (9), as in Figs. 4 and 5, using a search of the world wide 

web (www), uniqueness is determined at ( 10). if an unsatisfactory multiple or other 
! unsatisfactory domain profile is obtained, however, as in Fig. 3. the primer sequence 
| length and/or GC chimp size (or an added GC clamp) arc varied, or the splitting of the 
exon fragment (II). The latter variations arc also performed ( 1 I ) if lack of uniqueness 

, amongst the data base is found at (10). This is then followed by reformatting ( 1 2). 

I 

recording new GC clamp positions, etc., and rc-running the melting profiles with variation 

j 

of start/end positions, etc.. as required (13). A determination of substantially the same 
annealing temperature for the primers, as required for PCR sequencing, is then made (14), 
with variations as at ( 11 ) effected if the primer temperatures arc not substantially matched. 
The melting temperatures arc then calculated from the percentages of urea formamidc by 
the formula (15), enabling the making of a melting temperature versus number of bascpairs 
for all exons complete ( 1 6). Should the present exon of interest be located too near any of 
the others, the variations of step (II) arc employed to provide greater separation. If, 

I 
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however, there is no crowding, u table entry for the exon. including its number. GC clamp 
location, primer letter code and total length of letter code, mid melting temperatures Tm 
arc all recorded (17). 

If there are more exoits (IK). Ihc beginning step 3, Fig. 6A, and subsequent steps 
are repeated; otherwise, the procedure for exon primer determination is completed ( 19). 
Operation In Accordance Willi The Invention 

The computer-aided software-controlled microprocessor programming that 
underlies the semi-automatic or automatic exon primer determination procedure of the 
present invention, greatly speeds this design process, and with much more facility and 
flexibility. 

A semi-automatic version is shown in the microprocessor sofiwarc implemented 
Flow Chart of Figs. 7A and 7B, later described, controlling the computer display screen of 
Figs. 9 1o 14. The screen display format, as shown in Fig. 9, provides boxes or icons, 
such as "window" displays, an automatic plotting or graphing display region for simulated 
trial melting temperature profiles overall the gene fragments, and an automatically 
presented simulated 2-D gene fragment spot distribution presentation (2-D) of the type 
attainable by actual 2-D clcctrophorclic separation, as described in the said Vijg papers. 
Description Of Display Screen 

In (he lower right of the screen is the 2-D pattern-distribution display area, labeled 
"Migration Cli," in which the melting temperature (in %UF) of the fragments is plotted as 
a function of the size of the fragments. The purpose of this migration chart/pattern is to 
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insure that no Iwo fragments will occupy the same place in Ihc pattern, and thus mutations 
in alt of the fragments will be detectable. 

Above the Migration Ch area, under the title "Gene View", there is presented the 
window for displaying the sequence of the entire gene. The user can load the gene 
sequence into the program so that it is later possible to scroll through the gene in this 
window (o locate the exons, as in capital letters, and the nitrons as in lower case letters. 
The Gene View makes it easy to locate the beginning and end of the exons and indicates 
to the user the total length of the fragment (i.e. exon. gaps, primers, and GC clamp). 

On Ihc left -mind lower and central portion of the screen, there is provided a 
"melting temperature vs. base pair offset" mapping area: and between this mapping area 
and the Migration Ch area, there is a tabular column space in which the individual melting 
temperatures Tm of each base pair is printed and scrollable. 

In the upper center, boxes for indicating the selected number and fragment and the 
"exon length" arc provided, defining the selected exon/fragmcnt. The number the user 
enters here refers to Ihc fragment for which the user is designing a test. Sometimes an 
exon is too long to analyze in a single fragment and thus must be divided into different 
fragments. For example, if exon 3 of a given gene needed to be divided into three 
fragments, the user would number the fragments 3.1, 3.2, and 3.3. These numbers would 
rend exon 3 fragment I, exon 3 fragment 2, and exon 3 fragment 3. To the left and right of 
this box. arc windows or boxes labeled "gap", which arc the boxes that tell the user the 
length of the gap between the end of the primer and the exon sequence. The box to the 
left indicates the gap between the end of the leading primer and the beginning of the exon, 
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while the box to the right indicates the gap between the end of the exon and the beginning 
of the trailing primer sequence. 

To the Iclt ;ind right of the gap-length boxes, arc boxes labeled "leading primer" 
and "hailing primer, respectively. The numbers here represent the length of the primers, 
usually between I R and 25 base pairs in length, as before described. 

On the left and right of the primer boxes, there arc also provided left and right 
boxes labeled "GC Clamp". The user can click on these boxes to select whether to place 
the GC clamp on the leading or trailing primer. Beneath the GC clamp box is the word 
"type \ referring to the type of GC clamp that best suits a given fragment. The numbers 
I 2 and 3 arc assigned to correspond to the earlier-described illustrative 30, 35 and 38 
bascpair clamps. 

Below the primer boxes, arc boxes "anl" for indicating the Tin of the primer 
sequences, which arc referred to as the annealing temperatures of the primers. This 
function is especially useful in designing the before-described multiplex tests in which the 
annealing temperatures of the primers or the different fragments should optimally be 
within a few degrees of each other, as earlier stated. 

Directly under these "anl" boxes, in turn, further boxes arc provided in which the 
words "unique" or "not unique" appear. This refers to whether or not the given primer 
sequence occurs somewhere else in the gene. The function of this is to ensure that each 
amplicon has a set of primers which do not share sequence homology with any other 18- 
25bp sequence, so that only the desired fragments will be amplified. 
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The box thai reads "Melt" at the lop line of the display, when selected, commands 
the computer to calculate the 'I'm and construct a melting map of the fragment for the trial 
parameters which the user has indicated (primer lengths, etc.). Figs. 10-12. If the melting 
map is optimal, as in Tig. 12, and the fragment occupies a unique locution on the 2-D 
migration map, tiic user clicks on the box marked "Save Primers" at the top-line right. 
This will save all of the information concerning a given fragment; i.e. length of primers, 
gap length, GC clamp type and location. The information wilt be saved under whichever 
heading the user selected in the box designated the exon "fragment". 

The box Reset Primers will reset the computer to the standard conditions of, for 
example, a 30 bp gap length and I R bp primer length with no GC clamp. 

"Restore Primers" recalls saved information concerning a given primer. The user 
types the exon fragment information in the exon fragment box and clicks on the Restore 
Primer box. If the user types in 3.2, for example, the computer will recall the primer 
positions, gap lengths. GC clamp location and type of exon 3. fragment 2. 
Seiui-Autoitialic Operation 

Returning, now, to the semi-automatic version of operation in accordance with the 

invention, the operator inputs or controls, by a conventional "mouse" or the like, the 
sequencing of the program. The general procedure for the computer-aided design of 
optimized genetic screening tests based on such PCR/2-D electrophoresis (and including 
multiplex operation, as before discussed) is as follows: 



WO 98/06872 21 ^ PCT/EB97/00976 




Preliminary 

I . Retrieve Ihc gene sequence code (e.g., froi^Gc^ibank). While the gene sequence code 
is n four-letter code (that is, a particular combination of A's. T's. C*s and G's), the 
complete code thai specifics a gene can be as long as 100.000 letters. Only small parts, 
however, each not more than 100-600 letters long, arc considered important for making 
the test. These parts arc the exons, and indications arc given as to where they start and 
where they end in the tola! letter code. Small parts before and after each exon (splice 
sites) arc also included as well as so-called regulatory regions close to the actual gene or 
some distance apart. 

2. Find the exons in the letter code or specify other target fragments. 

3. Position primers so as to surround the exons. Such primers, as before explained, 
arc small parts of (he letter code. i.e.. about 20 letters, that define the boundaries of the 
exons to be tested. Primers should be completely outside the exon. from 5 letters outside 
it up to 100 letters outside. 

Where required, as previously explained, it is allowed to split exons in two; that is, 
position four primers rather than two, so as to define an exon by two fragments rather 
than one (for very large exons even more fragments arc allowed). In such case, however, 
the fragments must necessarily overlap by about 10 tetters so as to assure that the exon is 
completely covered. At least one of the two primers defining a fragment should be 
coupled to a GC-clamp, 



WO 98/06872 24 PCT/IB97/00976 

(Melting Criterion 

4. Run each fragment defined by its primers on a conventional melting profile 

^ 3 

generating program such as the bcforc-tncntioncd Mclt87 (or a derivative) program so as 



to generate melting curves. Each fragment starts with the firs! letter of its left primer and 
ends with the last letter of its right primer: so it includes the GC-clamp. Ideally, each 
melting curve, as previously explained, should consist of one high-melting domain (the 
GC-clamp) and one low-melting domain (the exon or part of it); and when this is not the 
case, the program should try other primer positions and/or split exons in half, and/or 
change (he size of (he GC-clamp, and/or add two, instead of one GC-clamp. 
I'CR conditions 

5 . Compare the an nealing te mperature of the selected primers. They should be 
almost Ihc same (at leas! the primers of one pair). The annealing temperatures of primers 
can be determined by using the conventional PCR-dcsign program, 
f). Compare the primer letter codes among each other and with the total tetter code of 
the gene, so as to avoid overlap with any other than the defined fragments. 
Tuo-Dimeusional Distribution 

7. Distribute the fragments defined by their primers over two dimensions; one 
dimension being the size (say, left to right) and the other dimension being the melting 
temperature (say, top to bottom) of the fragment. Avoid clustering, though some is 
unavoidable. The easiest way to separate clustered fragments is to change their size, 
which can be done by changing Ihc position of the primers and/or changing Ihc size of the 
GC-clamp (a 5-IU-lcltcr change is usually enough to separate fragments of the same size). 
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Select a umdicnt of increasing melting temperature .so as to allow an optimal spread ofthc 
fragments from top to bottom ofthc gel. 
RBI Example 

As a first example, the above steps will mow be schematically translated into the 
scheme, programmed under "Windows", for the before-mentioned tumor-supprcssci gene 
RBI. a large gene, and with a lest design that requires extensive optimization. Like many 
other genes, the sequence of this gene is available in Gcnbank from which it may be 
retrieved and copied into a Hie to be subjected to the procedure representing the invention. 
First, the start and endpoints of the exons or coding sequences of this gene arc indicated 
by typing in the respective sequences until the program indicates uniqueness on the 
display. Then, the system starts by assuming 18-mcr primer, gaps between primers, and 
exotis of 30 bp and a denned GC-clamp of 30 bp long. A melting profile of this 
configuration as well as the PCR annealing temperatures ofthc primers is automatically 
obtained. Either by observation of the screen or automatic monitoring; a decision is taken 
about the suitability ofthc melting curve. If not entirely satisfactory, one or more ofthc 
following parameters arc varied: gap-size, primer-size, GC-clamp, until a satisfactory 
profile and set of annealing temperatures is obtained. If this proves to be impossible, the 
fragment is split in two or more fragments until optimal melting behavior is attained. 

Referring more specifically to the previously mentioned Flow Chart of Figs. 7A 
and 7B, the illustrative RB gene is located (2 1 ) and file copied (22). The primer window. 
Fig. 9, is activated (23), injecting a trial primer letter sequence. The trial GC sequence is 
modified at (26), if requited. Automatic monitoring can cause variation ofthc primer start 
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and cntl positions as required (2R. 29) and doing so while monitoring uniqueness (30). 
inciting temperature mapping, and display separation at Migration Ch, Fig. 9. 

Jf proper trial values arc available, the melting profile generation is checked (32), 
mid the graph viewed (33) to insure n satisfactory single domain over (he gene fragments. 
If (here arc no more exons (34), the resulting primer tabic for nil exons, including their 
exon numbers, GC clamp locations, proper letter codes, total length of letter code, and 
Tin values arc reported by print out (35). completing the test design (36). 

In the event, however, that a warning is highlighted (31 ) of any of non-unique 
storting and/or trailing primers, primer annealing temperatures, or spot migration 
crowding or overlap, and/or in the event more exons arc present (34), repetition of 
variation of exon numbers (24) and GC clamp sequences (2fi) is effected to provide proper 
values or conditions. 
Automatic Version 

For fully automatic test design where the setting of trial values is effected by 
known types of feedback control in response to deviation from rules for a standard single 
domain melting profile, for example, predispos ed exons arc imported in the fil cjicader 
when enabled (42), Fig. RA, after automatic features (40) and the gene file (4 1 ) have been 
establi shed. As previously discussed, fragments less than, say, 500 long arc selected (43), 
and .short close exons arc combined (44). If desired, optimum break points on the melting 
curve of the entire gene may be located and/or unique lengths of fragments selected. Trial 
printers arc then inputted for all exons (45) which, from experience, preferably start, for 
example, with each equal to I R base pairs long, and with 30 base pair lengths for split 
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ends. The annealing temperatures arc Ihcn plotted (46) with selection (47) of the highest 
values, and the increase of all other primer lengths to achieve substantially the same 
annealing temperature (4R). As before described, the primer lengths arc adjusted (49). 
Pig. RB. for uniqueness, consistent with substantially common annealing temperatures - 
say, for example, within 5°C of one another. 

Starling with a first exon fragment (50), a prc-sct number (such as six) trial melting 
graphs arc run for three types of trial GC clamps (5 1 ), and the closest fit to the desired 
standard single domain profile shape (52) is selected (53), with the fit being measured, for 
example, by one of RMs or sigma deviation from a rectangle. If the profile design rules 
are not met, an a rot flag is raised or highlighted. For additional exons (53). steps (5 1 ) 
and (52) arc repeated. If the exous meet minimum display spread migration rules at 
migration Ch, Fig. 9, the design is complete (57). If, however, any exons violate minimum 
display spread migration nilcs (54), it is determined whether another melting profile can be 
selected to meet the required domain and spot migration spread rules (55); and, if not, 
different GC clamp lengths arc tried or an error flag again raised (56). 
P. 53 Gene Example 

In the successive displays of Figs. 10-14, another example is presented of the 
carrying out of the procedures of the invention for another impairment gene- -the 1*53 
gene. 

Starting with a number 5, fragment I, Fig. 10, for a gene sequence ttcccatcaag 
ccclagggctcct ("gene view" box), Irial values oTcxon length 184, gaps 30 leading and 
trailing primer lengths 18. and annealing temperature of 67.3, and no GC clamp, the multi- 
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domain unsatisfactory melting temperature profile was produced at the lower left. Adding 
a right-hand GC clamp of 30, Fig. 1 1 A. did not adequately improve the melting profile. 
Substituting a 30 GC clamp at the left provided some improvement in reducing the 
multiple domain excursions. Fig. 1 1 B. Reducing the gap length to 10, Pig. 12, however, 
provided a satisfactory flat domain following the GC clamp left-hand peak. 

As before described, tables of primer sequences arc displayablc with predicted 
average melting temperature as an expanded View Primer Tabic. Fig, 13: and the design- 
predicted resulting 2-D clcctrophorctic fragment spot "Migration Ch" chart may be 
displayed and enlarged as in Fig. 14. 

Other arrangements of display boxes and presentations arc. of course, possible 
with the same general underlying software programming concepts and methodology of the 
invention, and further modifications will occur to those skilled in this art, such being 
considered to fall within the spirit and scope of the invention as defined in the appended 
claims. 
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What is claimed is: 

I . A computer-assisted method for generating the design of optimized and 
comprclicnsivc genetic DNA diagnostic tests based on PCR/2-D electrophoresis for 
screening mutations in genes, that comprises, inputting a computer microprocessor with a 
desired exon fragment ATGC letter sequence; programming and controlling the 
microprocessor Inst to select ; i trial stm t and end of the sequence: then, selecting and 
positioning trial primer letter pairs to define the opposite boundaries of the exons to be 
tested; adding a trial GC clamp to at least one primer; generating trial melting profiles for 
the trial primers and GC clamp to determine whether the profile consists of a relatively 
high-mclting GC clamp peak and a substantially single flat domain of relatively low- 
melting value for the fragments; in the event that the trial profile has multiple and not a 
satisfactory single flat domain, selecting one or both of other trial primer sequences and 
GC clamp sizes and generating said profiles therefrom until such a satisfactory single fiat 
domain is attained, and for primers of each pair having substantially similar annealing 
temperatures suitable for I'CR annealing: comparing the primer letter sequences with other 
known gene sequences in the context of the total letter sequence of the gene exon to 
insure the uniqueness of the sequences and to avoid overlap with any other than the 
selected gene fragments: simulating a 2-D distribution of the fragments defined by the 
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primer pairs by separation along one dimension by size and along sin orthogonal dimension 
by melting temperature or the fragment; selecting a gradient of increasing melting 
temperatures along the orthogonal dimension to allow an optimal spread of tlic Fragments 
thcienloJig: and reporting the selected primer sets and temperature gradient, thereby 
providing on optimal design Tor an actual PCR/2-D electrophoresis test. 

2. A method its claimed in claim I and in which, in the event of indicated clustering of 
the fragments along the one dimension, changing their size by changing the position of the 
primers and/or changing the size of the GC clamp to separate fragments of similar size. 

3. A method as claimed in claim I and in which the sequence of the gene is obtained 
from a data base such as Gcnbntik and is copied into a file for use in carrying out said 
method. 

4. A method as claimed in claim I and in which, if a (rial melting profile is obtained 
that docs not have a satisfactory single domain, varying one or more of primer size, gap 
size between exons, and GC clamp size until a satisfactory profile and set of annealing 
temperatures is obtained. 

5. A method as claimed in claim A and in which the varying of the size of the GC 
clamping size is effected by using two instead of one GC clamp. 

6. A method as claimed in claim 1 and in which the annealing temperatures of the 
primers arc determined by a PCR-dcsign program. 

7. A method as claimed in claim 1 and in which, if a satisfactory melting profile is not 
obtained, the exon fragment sequence is split into parts. 
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ft. A method ;is claimed in chitin 7 ;md in which (he exon fragment sequence is split in 
(wo with two pairs of primers so as to define the exon by two fragments with sonic letter 
overlap so as to assure that the exon is completely covered. 

9. A method as claimed in claim I and in which the primer pair letter sequences that 
define the boundaries of the exons lo be tested arc positioned outside the exon letter 
sequences and arc small compared with the exou letter sequences. 

10. A method as claimed in claim 9 and in which the primer letter sequences arc of the 
order of about 20 letters. 

11. A method as claimed in claim 9 and in which the primers arc positioned from about 
5 to up to about 1 00 letters outside the exon letter sequences. 

1 2. A method as claimed in claim t ;iud in which the melting profile is displayed to a 
monitoring operator on a computer display screen, and the selecting of each of start and 
end of the exon fragment Ictlcr sequences, primer pair letter sequences and GC clamp arc 
manually entered by the operator and displayed on the screen. 

1 3. A method as claimed in claim 1 and in which the characteristics of the melting 
profile arc automatically monitored, and start end letter sequence, primer pair letter 
sequence and GC clamp arc automatically varied as required until a predetermined 
satisfactory single domain melting profile is attained. 

14. A rnctftod as claimed in claim I and in which the fragment distribution is displayed 
on a computer display screen. 
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15. A method as claimed in claim 14 and in which, upon an operator monitoring the 
fragment distribution display screen and noting undue fragment clustering, the operator 
enters wirialtoiis in one or more of the position of the primers and the size or the GC 
clamp to separate lite fragments* and to display the changes on the screen. 

16. A method as claimed in claim 14 and in which the fragment distribution is 
automatically monitored, and upon detection of undue fragment clustering, varying one or 
more of the position oftlic primers and the size of the GC clamp to separate the 
fragments. 

17. A method as claimed in claim I and in which the temperature gradient distribution 
is displayed upon a computer display screen. 

18. A method as claimed in claim I and in which an actual PCR/2-D electrophoresis is 
carried out with nn actual gene corresponding to said desired gene using the reported 
primer sets and temperature gradient. 

19. A method as claimed in claim 18 and in which the actual PCR operation is carried 
out after adding primer pairs to successive groups of the gene exons by effecting long- 
distance PCR amplifications thereof in a common tube or vessel to achieve relatively long 
resulting amplicons; then adding further primer pairs to each of the gene exons and 
effecting multiplex PCR amplifications thereof in the common tube or vessel with 
relatively short resulting amplicons: and then 2-D clcclrophotclically separating tlic gene 
fragments. 

20. A method as claimed in claim 1 and in which the reporting comprises visual display 
upon n computer display screen, under the control of the programmed microprocessor, of: 
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a) the simulated profiles of (he inciting temperature for the gene fragments: 

b) 2-D maps of the simulated melted fragments; 

c) a tabic of the primer sequences; 

d) readout of the annealing temperatures of the primers: 

c) verification of the uniqueness of the prima -gene sequences; and 
0 continuous display of the gene sequence during the whole procedure. 

21. A method as claimed in claim 20 and in which the melting temperature is presented 
as a percentage of urca-formamidc. 

22. A computer-assisted method for generating the design of optimized nud compre- 
hensive genetic DNA diagnostic testy based on PCR/2-D electrophoresis for screening 
mutations in genes, that comprises, programming and operating a computer microprocessor 
to input a desired gene exon fragment ATCG letter sequence: indicating the start and end 
of the desired sequence; generating simulated optimal conditions for PCR primer annealing 
sequences; generating an optimal electrophoresis 2-D fragment separation simulation 
including a simulated melting profile by varying trial fragment sequence size. GC clamp 
sequence size and primer sequence size to achieve a single domain profile of relatively low 
melting temperature for the gene fragments, clamped by a relatively high melting 
temperature sequence; and reporting the selected primer sets for being synthesized. 

23. A method as claimed in claim 22 and in which the reporting comprises displaying 
said primer and GC clamp sequences, melting temperature values and 2-D fragment size 
and melting temperature profile distribution on a computet display screen. 
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24. A method as claimed in claim 23 mid in which (he sequences and said values arc 
displayed in window or box type displays.. 

25. A method as churned in claim 23 and in which Ihc letter size variations arc effected 
by an operator viewing the trial values in window or box type displays on the display 
screen. 

26. A method as claimed in claim 22 and in which the said varying is effected 
automatically in response to prc-sct melting profile characteristics. 

27. A method as claimed in claim 22 and in which the variations arc effected in 
accordance with Ihc steps of the Flow Charts herein, 

28. Computer-aiding apparatus for generating the design of optimized and 
comprehensive genetic DNA diagnostic tests based on PCR/2-D electrophoresis Tor 
screening mutations in genes, having, in combination, a computer microprocessor being 
software-programmed to effect Ihc steps of method claim 20. 

29. Computer-aiding apparatus as claimed in claim 28 and in which the steps of the 
programming include those of method claim 1 . 
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FIG. 7 A 



BEGIN 



i 



7 



LOCATE GENE FILE BY OPENING NENTREZ, SELECT DATA BASE 
NUCLEOTIDE. SEARCH FOR GENE, e.g. RETINOBLASTOMA- 
LOCATE RB GENETIC FILE L11910. 
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COPY THE FILE TO GENE.BXT 

— i — : 
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CLICK ON 'PRIMER' ICON TO RUN NEW 'PRIMER' PROGRAM 



I 
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MODIFY DESIRED EXON NUMBER, MODIFY DESIRED EXON 
STARTING POSITION NUMBER IF DESIRED. 
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MODIFY DESIRED EXON ENDING POSITION NUMBER IF DESIRED. 



25 



— MODIFY GC CLAMP SEQUENCE IF DESIRED. 
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z 



27 



CUCK ON 'AUTOMATIC* TO AUTOMATICALLY DETERMINE PRIMERS 

IF DESIRED. 

NOTE: THIS IS IMPUMENTED IN FIGS. 3A AND 3B. 
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r 



MODIFY DESIRED STARTING PRIMER STARTING POSITION NUMBER. 



YES 



29 



MODIFY DESIRED ENDING PRIMER ENDING POSITION NUMBER. 



FIG. 7B 



CLICK ON 'CHECK UNIQUENESS. MELT TEMPS, 
MIGRATION SPREAD 
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<' 

ARE ANY OF THE FOLLOWING WARNINGS HIGHLIGHTED? \ 
NON-UNIQUE STARTING PRIMER, NON-UNIQUE TRAILING \ 
PRIMER. PRIMER ANNEALING TEMPERATURES ARE I 
DIFFERENT (CAN VIEW TEMPERATURES), MIGRATION / 
OVERLAP (CAN CHECK MIGRATION CHART)? / 



NO 



CLICK ON 'RUN MELTING PROFILE*. 
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VIEW GRAPH, VIEW TEMPERATURE LIST, 
DOMAIN OK? 



YES 



YES 




NO 



33 



ANY MORE EXONS? 



x 
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NO 
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PRINT PRIMER TABLE FOR ALL EXONS INCLUDING 
EXON §, GC CLAMP LOCATION, PRIMER LETTER 
CODE, TOTAL LENGTH OF LETTER CODE, 

Tm (%UF). 
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DONE 
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READ WHICH AUTOMATIC FEATURES ARE ENABLED. 
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IMPORT NEW GENE FILE IF ENABLED. 
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USE PREDEFINED EXONS IN IMPORTED FILE 
HEADER IF ENABLED. 
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SPLIT EXONS INTO FRAGMENTS LESS THAN 
500 LONG, MAY WANT TO DO MELT CURVE 
ON ENTIRE EXON TO TRY TO FIND OPTIMUM 
BREAK POINTS, (MAY ALSO WANT TO BREAK 
THEM INTO UNIQUE LENGTHS). 
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COMBINE SHORT CLOSE EXONS TOGETHER. 



y 
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CREATE PRIMERS FOR All EXONS, START 
WITH EACH EQUAL TO 18 BASE PAIRS LONG, 
EXCEPT ON SPLIT ENDS START WITH 30. 



i 
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PLOT THE ANNEALING TEMPERATURES. MAY 
HAVE TO MULTIPLEX(USE TWO OR MORE 
ANNEAUNG TEMPERATURES), ESPECIALLY 
IF SPLITTING FRAGMENTS. 



i 



46 



CHOOSE A TARGET ANNEAUNG TEMPERATURE 
THAT IS THE HIGHEST ANNEALING TEMPERATURE 

OF ALL PRIMERS. 
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INCREASE LENGTH OF ALL OTHER PRIMERS 
TO HAVE SAME ANNEAUNG TEMPERATURE. 
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ADJUST PRIMER LENGTH SO IT IS UNIQUE AND STILL 
WITHIN 5 DE6 C OF TARGET ANNEALING TEMPERATURE 



49 



START WITH EXON 1 FRAGMENT 1, OR WHICH 
EVER EXON IS ENABLED. 
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i 



z 



51 



GENERATE 6 MELTING GRAPHS (THREE TYPES OF GC 
CLAMPS TRIED ON BOTH LEADING AND TRAILING 
ENDS OF EXON PRIMERS). 



CHOOSE BEST ONE BY ASSIGNING A FIT MERIT RATING 
TO ALL GRAPHS (RMS OR SIGMA DEVIATION FROM A 
RECTANGLE), RAISE AN ERROR FLAG IF MELTING 
DOMAINE DOES NOT MEET ALL RULES. 



YES 
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ANY MORE EXONS? 




r 



53 



NO 



DO ANY EXONS VIOLATE MINIMUM 
SPREAD MIGRATION RULES? 




YES 




54 



CTED V- 



CAN ANOTHER MELTING GRAPH BE SELECTED 
FROM THE MERIT RATINGS THAT MEETS ALL 1 
MELT DOMAINE AND MIGRATION SPREAD J 

RULES? 'y f 



55 



NO 
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L 



DONE 



FIG. 8B 
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L 



56 



TRY A DIFFERENT GC CLAMP LENGTH OR RAISE AN 
ERROR FLAG AND SPLIT THE FRAGMENT AGAIN 

IF ENABLED. 
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